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The internet of things (IoT) is an ecosystem of connected objects that are 
accessible and available through the internet. This "thing" in the IoT could 
be a sensor such as a heart monitor, temperature, and oxygen rate in the 
blood. These sensors produce huge amounts of information that lead to 
congestion and an effect on bandwidth in the IoT network. In this paper, the 
proposed system is based on the Zstandard compression algorithm to 
compress the sensor data to minimize the amount of data transmitted from 
the IoT level to the fog level and decrease network overloading. The 
proposed system was evaluated using compression ratio, throughput, and 
latency time for healthcare applications. The result showed better calculation 
through decreased response time and increased throughput for transmitted 


IoT ; data compared with the case of non-compressed data. It showed the 
Raspberry Pi compression data ratio about 70% of orignial data, maximum number of IoT 
Zstandard algorithm sensor reads as 100, throughput is 85.43 B/ms, and fog processing delay is 
6.25 ms. 
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1. INTRODUCTION 

The internet of things (IoT) is a platform where virtual and physical goods are connected and 
interact with one another, according to its most basic description. The internet of things make use of a variety 
of technologies, including cloud computing, wireless sensor networks, and embedded intelligence, to 
accomplish its goals [1], [2]. Environmental data is collected by internet of things devices via the use of radio 
frequency identifier (RFID), cameras, sensors, and other technologies [3], [4]. Some of the complex 
capabilities offered with these systems include remote monitoring, online analytics, and remote management. 
Remote monitoring applications based on the IoT are being utilized in several disciplines, ranging from 
healthcare to smart manufacturing, as well as smart homes, smart cities, smart agriculture, and among other 
areas [5], [6]. 

Fog computing is an expanded architecture of cloud computing that leverages edge devices to 
provide computation, storage, analysis, and communication between smart end devices and cloud computing 
data centers. As a result, processing and intelligence are brought closer to the point of origin [7]. Fog 
computing is becoming more important for IoT because of its ability to provide quick response and low 
latency between the cloud and IoT levels. Fog computing may be comprised of a large number of nodes with 
a widely dispersed geographical distribution and it is distinguished by its mobility and spatial awareness [8]. 

Data compression is a critical component of contemporary digital communication; it reduces the 
number of bits required to encode data from its original size [9], [10]. Data transmission is greatly reduced at 
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the source end and then recovery is assisted by this method. Using this method, a significant amount of 
energy and bandwidth is saved [11], [12]. There are many ways to compress data, but they may be broadly 
categorized into two types: lossless techniques and lossy methods. By finding and removing statistical 
redundancy, lossless compression minimizes the number of bits needed to encode a given file size. Lossless 
compression ensures that no data is lost. By deleting unneeded or less essential data, lossy compression 
decreases the number of bits [13], [14]. IoT environment produce huge amounts of information and IoT 
networks do not have sufficient bandwidth due to constrained resources [15]. To save IoT network bandwidth 
and prevent network congestion issues, and network overhead, so the IoT data must be compressed prior to 
transmission to decrease size of transmitted data [16], [17]. 

To meet the latency, traffic reduction, privacy, and bandwidth requirements of IoT applications, the 
IoT network must be able to respond quickly to data from these apps. The smart healthcare application, for 
example, requires strong security and privacy for patient data, a rapid reaction to emergency status, and high 
bandwidth for uploading the enormous quantity of daily sensed patient data over the IoT network, among 
other requirements [18]. The present IoT network is unable to meet these demands due to bandwidth 
restrictions, an unavoidable lengthy delay, and the high cost of uploading such a large volume of information 
[19]. Therefore, the proposed system works in an efficient way to minimize the amount of data transmitted 
from the IoT level to the fog level and decrease network overloading using one of data compression 
algorithm in real experiment. 

There are different related works associated with the IoT healthcare data compression as follow: a 
novel lossless electroencephalographic (EEG) data compression strategy has been developed in [20], it is 
enabled by fog computing. The method is being developed to decrease the amount of EEG patient data that is 
received at the fog gateway before it is sent to a cloud platform. The EEG data compression technique is 
divided into two phases: clustering and compression. First, using agglomerative hierarchical clustering, the 
incoming data is organized into groups of similar data. In the second step, the Huffman encoding is applied to 
each of the clusters that have been generated. The compressed data from smaller clusters are concatenated 
and sent from fog to cloud at the end of the process. Using hierarchical clustering and Huffman encoding 
(HCHE), the researchers discovered that the proposed strategy performed much better in terms of EEG data 
reduction size and compression ratio by taking advantage of similarities between data from different clusters. 

Management of data traffic has been proposed in [21] for compression and minimum description 
length (MDL). The goal of the proposed is to reduce the energy consumption of each node and prolong the 
wireless sensor network’s (WSN’s) lifespan while maintaining acceptable data accuracy. In the first place, a 
lightweight lossless compression method based on differential encoding and Huffman techniques are used, 
which is especially useful for IoT nodes that monitor the environment and have limited computational and 
memory resources. A hierarchical clustering approach was used in the second level of MDL to cluster 
datasets that had been collected in the first level. The performance of the sensors is evaluated based on the 
real data collected by the sensors and the use of the objective modular network testbed in C++ (OMNeT++) 
simulator. In terms of results, a reduction in the volume of data submitted to the GW of up to 79-93.3 percent 
was achieved, as was a reduction in the energy expended of up to 76-82 percent. A compression ratio of up to 
87-93 percent was achieved. 

The compression-based data reduction (CBDR) approach has been proposed in [22] to operate at the 
level of internet of things sensor nodes. The approach, which is being developed to reduce data transmissions 
in the internet of things sensor networks, has the potential to save a significant amount of energy and prolong 
its lifetime. It incorporates two phases of compression: a lossy symbolic aggregate approximation (SAX) 
quantization stage that minimizes the dynamic range of sensor data readings and a lossless Lempel-Ziv— 
Welch (LZW) compression step that compresses the output of the loss quantization stage. Another 
modification to the CBDR approach has been proposed, which is the addition of a dynamic transmission 
component (DT-CBDR). It is utilizing the OMNeT++ simulator in conjunction with actual sensory data 
collected at Intel Lab. Its findings revealed that CBDR and DT-CBDR protocols outperformed PFF and Harb 
protocols in terms of workload reduction, with up to 79 percent and 80 percent reductions in the amount of 
data collected, 74 percent to 80 percent reductions in the amount of data transmitted, and 78 percent 
reduction in the amount of energy used, while CBDR and DT-CBDR techniques achieve high compression 
ratios (above 95 percent). 

Passos et al. [23] proposed energy-efficient local data compression for transmission over wireless body 
area networks (WBAN) (GROWN) as a hybrid technique to local data compression. The technology is being 
developed to reduce information redundancy during data transmission, conserve energy, and extend the lifespan 
of wearable devices in wireless body networks, among other things. It is called GROWN because it coordinates 
compression with the kind of data signal gathered, minimizing redundant data transmission and so lowering 
energy consumption by devices in the network. A threshold is used to restrict the allowable data losses that may 
occur during transmissions and lossless compression is used by encoding tables, which normalize the data 
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format to reduce how much information is sent over a network connection. GROWN has achieved an energy 
efficiency of up to 53.73 percent with a maximum delay of 55 milliseconds, according to the company. 

Hossain and Roy [24] proposed a two-layered data compression architecture for the internet of 
things data is presented, which decreases the quantity of data transmitted while maintaining a low error rate 
and avoiding bandwidth wasting. A lossy compression framework has been developed for sensor data and 
attempts have been made to reduce the error rate in the lossy compression process. A pair of algorithms have 
been developed that operate primarily at two layers: the fog and the cloud nodes. A collection of IoT sensor 
data is collected by the system, which then compresses and saves it in the cloud. When the user requests the 
data, it decompresses and performs specific operations to dispense the right information to him or her. The 
results show that at the fog nodes, the data was compressed by 50%. In cloud storage, we have compressed 
the data up to 90%. Even after decompression, we found that there is a small difference between the original 
data and the data after it has been uncompressed. 

Shrividhiya et al. [25] propose hybrid compression, which is named as “data compression algorithm 
utilizing LZW framework based on Huffman technique to offer better compression ratio. The Huffman 
algorithm is used to compress the data, and the output of this process is transferred to the LZW algorithm, 
which produces the final result (Huffman-based LZW for data compression). The final compressed ratio and 
saving for all of the input data is determined at the conclusion of two compressions. The output shows five 
distinct inputs with varying storage capacities ranging from a few bytes to 5 kilobytes. This hybrid technique 
achieves a compression ratio of 2.417, a compression factor of 0.425, and a saving percentage of 57.7 percent 
as a result. The findings were compared to the well-known LZW and Huffman algorithms, and the results 
revealed that the results were more efficient and less susceptible to mistakes than these two algorithms. 
Furthermore, this hybrid method performs better when dealing with large amounts of data. 

In addition to the introduced of this paper, the used system based on a real-time compression 
method for internet of things data gathered by many IoT devices using fog computing has been adopted. The 
Zstandard compression algorithm is used for IoT sensor data. It will compress data and send to the fog for 
decompression. It evaluated using compression ratio, throughput, and latency time for healthcare 
applications. The paper can be outlined in the following way: 1) introduction, 2) the proposed approach, 
3) method, 4) results and discussion, and 5) conclusion. 


2. THE PROPOSED APPROACH 

The proposed approche is based on the Zstandard algorithm. It is mainly used for compression 
purposes in the proposed system. It works at the IoT sensors level to compress their readings in an efficient 
way to minimize the amount of data transmitted from the IoT level to the fog level and decrease network 
overloading. The general view of the proposed system showed in Figure 1. The flowchart of the system is 
explained in Figure 2. The main steps of the proposed approach are as shown in: 


4-Evaluation Data 


3- Decompression Data Fog Layer 


Response 


Request 


2- Compression Data 
using Zstandard Method 


1- Read Data from sensor 


Figure 1. The general view of the proposed system 
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Figure 2. The flowchart of the proposed system 


2.1. System implementation steps 


The implementation steps of the proposed approach are sorted as shown in: 
a. Read sensor data 

Initializing the sensors (temperature, heart rate, oxygen level) read data from the patient’s body. The 
Raspberry Pi (RPi) received data from sensors, and aggregation sensing information as data messages from 
IoT sensors in the Raspberry Pi such as temperature, heart rate, and oxygen level sensors. 
b. Compression phase (IoT layer using Zstandard algorithm) 


The compression process in Raspberry Pi compresses the data collected by sensors using Zstandard 
algorithm to result in compression data (CD). The compression data on the Raspberry Pi will be sent through 


a wireless connection by the socket (IP address) to the server, as shown in Figure 3 implementation of the 
proposed system. 
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Figure 3. Implementation of the proposed compression algorithm 


c. Decompression phase (fog layer using Zstandard algorithm) 


The server will receive a CD form the Raspberry Pi. It will run decompression process to 


decompress the CD by using a Zstandard algorithm that returns data message to its original form. The data is 
the analysis and store the data in the server. 
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2.2. System specifications 

The proposed system components include the IoT devices as Raspberry Pi, a temperature sensor, a 
heart rate sensor, an oxygen sensor of the network data devices, and the management device as a fog server 
(computer). Table 1 contains information on the specifications of the used network devices. The proposed 
system is based on the Raspberry Pi as a fundamental construction component due to its characteristics: it is 
tiny, inexpensive, powerful, and completely adaptable. In addition, it's a programmable tiny computer board 
that comes pre-loaded with support for a wide variety of input and output ports, as well as a network 
connection. It is used in the proposed system for monitoring and controlling sensing devices. It has the 
process of compression of data gathered by IoT sensors. 

Besides, the temperature sensor in the proposed system senses the temperature between -55 and 
+125 degrees. Furthermore, the heart rate sensor is used for sensing heart pluses and capturing them as data 
messages. Alongside, the oximeter sensor is used to sense the rate of oxygen and heart rate in the human 
body, in the last the fog server device is used for monitoring and controlling the entire network traffic. 
Furthermore, all sensing information from IoT sensors (temperature degree, heart rate, and oxygen level) is 
collected on the Raspberry Pi, which then compresses this sensing information and sends it to the server. 

The proposed system implemented with Python programming language, it is used on both the 
Raspberry Pi and the server devices. Python is a computer language that is useful for a wide range of tasks 
and is both adaptable and powerful. Because it is succinct and simple to understand, it makes an excellent for 
fog computing implementations. Python can do any task you set for it. Python is a programming language 
that may be used for a variety of tasks, including resource management, data analysis, data compression, and 
different approaches for network data analysis. 


Table 1. Specification of devices nodes 


Device type Device model Supply voltage Product size 
Raspberry Pi 4 model B 8 GB DC 5 V/2.5 A 82 mm x 56 mm x 19.5 mm, 50 g 
Temperature sensor DS18B20 waterproof digital thermal probe 1 m 3.0 V to 5.5 V 6*50 mm 
A sensor of oxygen Pulse Oximeter (SPO2) sensor MAX30100 3.3 V 5.6 mm x 2.8 mm x 1.2 mm 
and heart rate 
Fog server Dell Inspiron 5559 19.5 V 


2.3. System evaluation metric 
The proposed system is evaluated with the main metrics as shown in: 
a. Data compression ratio (DCR): it is used in the proposed system to for measured how much data 
representation can be compressed before being transferred. The most common way to describe this is as 
the ratio of compressed to uncompressed data size. 


size of data—size of Compressed data 


DCR (%) = ( ) * 100% (1) 


size of data 


b. Throughput: it is used in the proposed system to measure maximum quantity of data that can be sent from 
source to destination in a particular length of time through an internet connection. 


Ee of send saray 


Time 


Throughput = (2) 

c. Latency (delay): it is used in the proposed system to measurement of the amount of time it takes for data 
to travel across a network to its destination. As a result, latency has a significant influence on the overall 
performance of the network. It is customary to quantify latency in milliseconds (ms). 


3. METHOD 

This algorithm is kind of a lossless compression algorithm; it ensures that all data is sent without 
any loss. It is a high-compression-ratio method that provides both quick and efficient features. It also has a 
particular mode for small amounts of data, known as "dictionary compression," that may be used. There is a 
very broad range of speed/compression trade-offs available in the reference library, which is supported by a 
very fast decoder. Algorithm 1 describes the process of compressing data readings. Besides the 
decompression messages process showed in Algorithm 2. 
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Algorithm 1. Zstandard algorithm compression 
Initialize table with single character strings 
P = first input character 
WHILE not end of input stream 
C = next input character 
IF P + C is in the string table 
P=P+C 
E 
o 
a 


LSE 
utput the code for P 
dd P + C to the string table 


END WHILE 
output code for P 


Algorithm 2. Zstandard algorithm decompression 
Initialize table with single character strings 
OLD = first input code 

output translation of OLD 

WHILE not end of input stream 

NEW = next input code 

F NEW is not in the string table 

= translation of OLD 

=5+C 

LSE 

= translation of NEW 

utput 5 

= first character of S 

LD + C to the string table 

LD = NEW 

D WHILE 


ZOOQOUNRANUH 


ti 


4. RESULTS AND DISCUSSION 

The proposed fog computing system result is based on the main network structure, as it analyzed 
between the IoT, and fog layers of the fog computing. Furthermore, the proposed system results are based on 
the two case studies as the first case study is based on the without compressed data while the second one is 
based on the case of compressed data with the Zstandard algorithm. 


4.1. The 1* case study of (without compressed) method 

In this case, the proposed system was implemented with Python programming language to evaluate 
the fog network performance based on the throughput parameter as shown in Table 2. The 1* case study with 
number of IoT sensor reads as 20, 50, and 100 with different input data sizes as 273 B, 671 B, and 1338 B 
respectively. It showed that the throughput increased with increased number of reads. Table 3 shows the fog 
processing delay for each IoT sensor reads. Figure 4 shows the 1* case study among evaluation metrics. 


Table 2. Throughput of the 1* case study Table 3. Latency of the 1“ case study 
Number of IoT Data size Execution’s Throughput Number of IoT sensor Fog processing delay 
sensor reads (Byte) time (msec) (Byte/msec) reads (msec) 
20 273 12.28 22.23 20 6.25 
50 671 18.77 35.74 50 7.81 
100 1338 39.45 33.91 100 9.37 
60 
40 X 
20 N 
W 
Wie sE NIe 
7 N= Nie NE 
1 2 3 
m Data size (KB) « Execution’s time (ms) 
|| Throughput(B/ms) = Fog Processing Delay (msec) 


Figure 4. System parameters for the 1* case study 


Design an efficient internet of things data compression for healthcare applications (Ahmed Najah Kahdim) 


1684 O ISSN: 2302-9285 


4.2. The 2" case study of (with compression) method 
In this case study the proposed system is based on the Zstandard compression algorithm to compressed 


input sensor data due to the IoT limitation device and fewer resources so the overloaded in the network will 
affect directly on the network performance, so this algorithm decrease data size and network delay through 
transmitting data from the source IoT sensors to the final destination (data management server). Table 4 shows 
the data compression ratio of the 2"4 case study with a different number of reads. Table 5 showed the throughput 
value of the 2" case study based on the compressed sensor data. In addition, Table 6 showed the latency of the 
data processing in the fog system is calculated depending on the different input reads. 


Table 4. Data compression ratio of the 2™ case study Table 5. Throughput of the 2™! case study 
Size of Size of Number of Data Pane 
NumberotloT original data compression data DCR IoT sensor size Execution 5 Throughput 
sensor reads (Byte) (Byte) reads (Byte) time (ms) (Byte/ms) 
20 273 115 57% 20 115 3.9 ms 29.48 
50 671 219 67% 50 219 4.3 ms 50.93 
100 1338 393 70% 100 393 4.6 ms 85.43 


Table 6. Latency of the 2"! case study 


Number of IoT sensor reads Fog processing delay (ms) 
20 6.25 ms 
50 6.25 ms 
100 6.25 ms 


Figure 5 showed the 2™ case study among evaluation metrics. The Zstandard compression algorithm 
used in the proposed system due to the flexibility of use, easy to setting, and less overhead, as well as less 
computation power (CPU, RAM, and execution time) resources to executed and it was the better choice to 
work in IoT resource limitation environment compared with other compressed algorithms such as LWZ 
algorithm which effects on the system resource through executing processes. It showed decrease response 
time and increased throughput for transmitted data compared with the case of non-compressed data. 


100 


50 
| Y 
0 Ge — Vj > 


NS 


SS 
QV 


20 Reads 50 Reads 100 Reads 
E Data size (KB) z Execution’s time (ms) 
Z Throughput(B/ms) = Fog Processing Delay (ms) 


Figure 5. System parameters for the 2™ case study 


4.3. The 3" case study of (system comparison) 
The proposed system compared with different related work based on the compression of IoT sensing data 


and it showed that the proposed system provides better results in compressed data size and ratio value, as compression 
data size is 881 B, and the compressed ratio is 70.6 B. Table 7 showed comparison with other related work. 


Table 7. The comparison with other related work 
Original data size in | Compression data size 


Programming E P 

Refences.No language Algorithm bytes in bytes Ratio 
2021 [25] Python Huffman based LZW 3000 1072 64% 

The proposed system Python Zstandard 3000 881 70% 
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5. CONCLUSION 

By using the Zstandard compression technique, the suggested system may reduce the quantity of 
data sent from the IoT device to the fog server significantly. The hardware used in this proposal is the 
Raspberry Pi 4 model B and two types of sensors: DS18B20 (temperature sensor-waterproof) and 
MAX30102 (pulse oximeter and heart-rate sensor). These sensors read information and send data to the 
Raspberry Pi. The programming language used in the paper is Python on the server and IoT devices used the 
Zstandard compression algorithm in the proposed system due to the flexibility of use, ease of setting, and less 
overhead, as well as fewer computation power (CPU, RAM, and execution time) resources required to 
execute. The proposed system was evaluated using compression ratio, throughput, and latency time for 
healthcare applications. The compression ratio is set when a file size of 3000 bytes is compressed to 881 
bytes at a rate of 70%, which provides better results compared with different related work. For further study, 
we suggest combine data compression with cryptographic for more security and speed in transmitted data. 
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